Solutions to a Class of Nonstandard Stochastic Control Problems with Active Learning

نویسنده

TAMER BASAR

چکیده

Abstrucf-We formulate and solve a dynamic stochastic optimization problem of a nonstandard type, whose optimal solution features active learning. The proof of optimality and the derivation of the corresponding control policies is an indirect one, which relates the original single-person optimization problem to a sequence of nested zero-sum stochastic games. Existence of saddle points for these games implies the existence of optimal policies for the original stochastic control problem, which, in turn, can be obtained from the solution of a nonlinear deterministic optimal control problem. The paper also studies the problem of existence of stationary optimal policies when the time horizon is infinite and the objective function is discounted.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonstandard explicit third-order Runge-Kutta method with positivity property

When one solves differential equations, modeling physical phenomena, it is of great importance to take physical constraints into account. More precisely, numerical schemes have to be designed such that discrete solutions satisfy the same constraints as exact solutions. Based on general theory for positivity, with an explicit third-order Runge-Kutta method (we will refer to it as RK3 method) pos...

متن کامل

New operational matrix for solving a class of optimal control problems with Jumarie’s modified Riemann-Liouville fractional derivative

In this paper, we apply spectral method based on the Bernstein polynomials for solving a class of optimal control problems with Jumarie’s modified Riemann-Liouville fractional derivative. In the first step, we introduce the dual basis and operational matrix of product based on the Bernstein basis. Then, we get the Bernstein operational matrix for the Jumarie’s modified Riemann-Liouville fractio...

متن کامل

A Neural Network Method Based on Mittag-Leffler Function for Solving a Class of Fractional Optimal Control Problems

In this paper, a computational intelligence method is used for the solution of fractional optimal control problems (FOCP)'s with equality and inequality constraints. According to the Ponteryagin minimum principle (PMP) for FOCP with fractional derivative in the Riemann- Liouville sense and by constructing a suitable error function, we define an unconstrained minimization problem. In the optimiz...

متن کامل

Comparison between the effects of flipped class and traditional methods of instruction on satisfaction, active participation, and learning level in a continuous medical education course for general practitioners

Background and Aim: Physicians’ knowledge and capabilities decrease over time; therefore, continuous medical education is important. Flipped class is a blended teaching method that inverts instructional cycle by delivering the educational content by innovative technology out-of-class. The aim of this study was to determine the effects of flipped class on satisfaction, active participation,...

متن کامل

Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory

This paper gives an overview of linear programming methods for solving standard and nonstandard Markovian control problems. Standard problems are problems with the usual criteria such as expected total (discounted) rewards and average expected rewards; we also discuss a l~articular class of stochastic games. In nonstandard problems there are additional considerations as side constraints, multip...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Solutions to a Class of Nonstandard Stochastic Control Problems with Active Learning

نویسنده

چکیده

منابع مشابه

Nonstandard explicit third-order Runge-Kutta method with positivity property

New operational matrix for solving a class of optimal control problems with Jumarie’s modified Riemann-Liouville fractional derivative

A Neural Network Method Based on Mittag-Leffler Function for Solving a Class of Fractional Optimal Control Problems

Comparison between the effects of flipped class and traditional methods of instruction on satisfaction, active participation, and learning level in a continuous medical education course for general practitioners

Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory

عنوان ژورنال:

اشتراک گذاری